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Abstract. Suppose / is a real univariate polynomial of degree D with exactly 4 monomial 
terms. We present an algorithm, with complexity polynomial in log D on average (relative 
to the stable log-uniform measure), for counting the number of real roots of /. The best 
previous algorithms had complexity super-linear in D. We also discuss connections to sums 
of squares and ^l-discriminants, including explicit obstructions to expressing positive definite 
sparse polynomials as sums of squares of few sparse polynomials. Our key tool is the 
introduction of efficiently computable chamber cones, bounding regions in coefficient space 
where the number of real roots of / can be computed easily. Much of our theory extends to 
n-variate {n + 3)-nomials. 

1. Introduction 

Counting the real solutions of polynomial equations in one variable is a fundamental in- 
gredient behind many deeper tasks and applications involving the topology of real algebraic 
sets. However, the intrinsic complexity of this basic enumerative problem becomes a mys- 
tery as soon as one considers the input representation in a refined way. Such complexity 
questions have practical impact for, in many applications such as geometric modelling or the 
discretization of physically motivated partial differential equations, one encounters polyno- 
mials that have sparse expansions relative to some basis. So we focus on new, exponential 
speed-ups for counting the real roots of certain sparse univariate polynomials of high degree. 

Sturm sequences |Stu35] . and their later refinements |Hab48t IBPR06] . have long been a 
centrally important technique for counting real roots of univariate polynomials. In combi- 
nation with more advanced algebraic tools such as a Grobner bases or resultants |GKZ94[ 
IBPROGj . Sturm sequences have even been applied to algorithmically study the topology of 
real algebraic sets in arbitrary dimension (see, e.g., |BPR06[ Chapters 2, 5, 11, and 16]). 
However, as we will see below (cf. Examples II. II and II .2p . there are obstructions to attaining 
polynomial intrinsic complexity, for sparse polynomials, via Sturm sequences. So we must 
seek alternatives. 

More recently, relating multivariate positive polynomials to sums of squares has become 
an important algorithmic tool in optimizing real polynomials over semi-algebraic domains 
|Par03t ILasOQj . However, there are also obstructions to the use of sums of squares toward 
speed-ups for sparse polynomials (see Theorem ll.Sl below). 

Discriminants have a history nearly as long as that of Sturm sequences and sums of 
squares, but their algorithmic power has not yet been fully exploited. Our main result 
is that A- discriminants |GKZ94] yield an algorithm for counting real roots, with average- 
case complexity polynomial in the logarithm of the degree, for certain choices of probability 
distributions on the input (see Theorem 11.41 below) . The use of randomization is potentially 
inevitable in light of the fact that even detecting real roots becomes NP-hard already for 
moderately sparse multivariate polynomials |BRS09t IPRT09] . 
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1.1. Prom Large Sturm Sequences to Fast Probabilistic Counting. The classical 
technique of Sturm Sequences |Stu35l IBPROGj reduces counting the roots of a polynomial 
/ in a half-open interval [a, b) to a gcd-like computation, followed by sign evaluations for 
a sequence of polynomials. A key difficulty in these methods, however, is their apparent 
super-linear dependence on the degree of the underlying polynomial. Consider the following 
two examples (see also |RY05l Example 1]). 

Example 1.1. Setting f{xi) ■.=xl^'^^^^ — 2x\^^'^^^ + 1, the realroot command in Maple 1^ 
(which is an implementation of Sturm Sequences) results in an out of memory error after 
about 31 seconds. The polynomials in the underlying computation, while quite sparse, have 
coefficients with hundreds of thousands of digits, thus causing this failure. On the other hand, 
via more recent work [BRSOQj . one can show that when c>0 and g{xi) : = xf^™^ — cxl^^"^^^ + 1 , 
g has exactly 0, 1, or 2 positive roots according as c is less than, equal to, or greater than 
(121393121393 i96'?i8MMi8)i/3i78ii - 1.944.... In particular, our f has exactly 2 positive roots. (We 
discuss how to efficiently decide the size of monomials in rational numbers with rational 
exponents in Algorithm \2.15\ of Section l273[ below.) o 

Example 1.2. Going to tetranomials, consider f{xi) ■.= axY"^^'^^ — aj^ooos _|_ ^^50004 _ yjHh 
a,b > 0. Then (via the classical Descartes' Rule of Signs |RS02[ Cor. 10.1.10, pg. 319]j 
such an f has exactly 1 or 3 positive roots, but the inequalities characterizing which (a, b) 
yield either possibility are much more unwieldy than in our last example: there are at least 
2, involving polynomials in a and b having tens of thousands of terms. In particular, for 
{a,b) = (2, i), Sturm sequences on Maple 14 result in an out of memory error after about 
122 seconds, o 

We have discovered that ^-discriminants, reviewed in Section [21 enable algorithms with 
complexity polynomial in the logarithm of the degree. 

Definition 1.3. For any x={xi, . . . , Xd) € C^, we define Log|a;| := (log |a;i|, . . . , log \xd\) and 
let the stable log-uniform measure on (resp. TJ^) be the probability measure v (resp. v' ) 
defined as follows: iy{S):= lim ) (resp. u'{S):= lim #(Log|-5'|n{-M^...,M} 

where fi denotes the standard Lebesque measure on Mf^ and #(■) denotes set cardinality, o 

Note that the stable log-uniform measure is finitely additive (but not countably additive), 
and is invariant under refiection across coordinate hyperplanes. 

Theorem 1.4. LetQ<a2<a^<a4^ = D he positive integers, 

/(Xi) : = Ci + C2xf + C3xf + 

with Ci, . . . , C4 being independent stable log-uniform random variables chosen from M (resp. 
TL), and define /i:=log(2+maXj |q|). Then there is a deterministic algorithm, with complexity 
polynomial in \o<gD (resp. h and logD), that computes a number in {0,1,2,3} that, with 
probability 1, is exactly the number of real roots of f . The underlying computational model 
is the BSS model over M (resp. the Turing model). 

The key idea is that while the regions of coefficients determining polynomials with a constant 
number of real roots become more complicated as the number of monomial terms increases, 
one can nevertheless efficiently characterize large subregions — chamber cones — where the 



^Running on a 16GB RAM Dell PowerEdge SC1435 departmental server with 2 dual-core Opteron 2212HE 
2GI1Z processors and OpenSUSE 10.3. 
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number of real roots is very easy to compute. This motivates the introduction of probabihty 
and average-case complexity. The c/l- discriminant allows one to make this approach com- 
pletely precise and algorithmic. In fact, our framework enables us to transparently extend 
Theorem 11.41 to n-variate (n -|- 3)-nomials (see Theorem 13.181 of Section 13731) . 

Our focus on the stable log-uniform measure simplifies our development and has some 
practical motivation: when one considers A^-bit floating-point numbers with uniformly ran- 
dom exponent and mantissa, taking — )■ +oo and suitably rescaling yields exactly the 
stable log-uniform measure on Z. The stable log-uniform measure has also been used in 
work of Avendano and Ibrahim to study the expected number of roots of sparse polynomial 
systems over local fields other than M [Alllj . 

It is of course quite natural to ask how the expected complexity in Theorem 11.41 behaves 
under other well-known measures, e.g., uniform or Gaussian. Unfortunately, the underly- 
ing calculations become much more complicated. On a deeper level, it is far from clear 
what a truly "natural" probability measure on the space of tetranomials is. For instance, 
for non-sparse polynomials, it is popular to use specially weighted independent Gaussian 
coefficients since the resulting measure becomes invariant under a natural orthogonal group 
action (see, e.g., |Kos88t ISSQGj IBSZOO] ). However, we are unaware of any study of the types 
of distributions occuring for the coefficients of polynomials actually occuring in physical 
applications. 

The speed-ups we derive actually hold in far greater generality: see |BRS09[ IPRTOQj for 
the case of n-variate (n -|- fc)-nomials with /c<2. Section [3] here for connections to ri-variate 
{n -\- 3)-nomials, the forthcoming paper [AARll] for the general univariate case, and the 
forthcoming paper [PRRTll] for chamber cone theory for n x n sparse polynomial systems. 
One of the main goals of our paper is thus to illustrate and clarify the underlying theory in 
a non-trivial special case. We now state our second main result. 

1.2. Sparsity and Univariate Sums of Squares. Recent advances in semidefinite pro- 
gramming have produced efficient algorithms for finding sum of squares representations of 
certain nonnegative polynomials, thus enabling efficient polynomial optimization under cer- 
tain conditions. When the input is a sparse polynomial it is then natural to ask if there 
is a sum of squares representation that also respects sparseness. Indeed, it is well-known 
that a nonnegative univariate polynomial can always be written as a sum of two squares 
of, usually non-sparse, polynomials (see, e.g., |Pou71] for refinements). The following result 
demonstrates that a sparse analogue is either unlikely or much more subtle. 

Theorem 1.5. There do not exist absolute constants i and m with the following property: 
Any trinomial f G ]R[xi] that is positive on M can be written in the form f = gf + ■ ■ ■ + gj, 
for some gi, . . . , giEM.[xi] with gi having at most m terms for all i. 

Were there to be a sufficiently efficient representation of positive sparse polynomials as sums 
of squares, one could then try to use semidefinite programming to find such a representation 
explicitly for a given polynomial. This in turn could yield an efficient reduction from deciding 
the existence of real roots to a (small) semidefinite programming problem, similar to the 
techniques of |Par03j . Our last theorem thus reveals an obstruction to this sums of squares 
approach. 

1.3. Related Approaches. The best known algorithms for real root counting lack speed- 
ups for sparse polynomials like the average-case complexity bound from our first main result. 
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For example, in the notation of Theorem ll.4l |LM01j gives an arithmetic complexity bound of 
0{D log^ D) which, via the techniques of |BPR06] . yields a bit complexity bound super-linear 
in h + D. No algorithm with complexity polynomial in logD (deterministic, randomized, or 
average-case) appears to have been known before for tetranomials. (See |HTZEKM09j for 
recent speed benchmarks of univariate real solvers.) 

As for alternative approaches, softening our concept of sparse sum of squares representa- 
tion may still enable speed-ups similar to Theorem 11.41 via semidefinite programming. For 
instance, one could ask if a positive trinomial of degree D always admits a representation as 
a sum of hg^'-^^D squares of polynomials with log^^^^D terms. This question appears to be 
completely open. 

Example 1.6. Observe that a quick derivative computation yields that 

/(xi):=xf - 2'=xi + 2^' - 1 
attains a unique minimum value of at x = 1. So this f is nonnegative. On the other 

hand, one can prove easily by induction that f{xi) = 2^^^^ Yl'i=o ¥ (^^f ~ ' ^^'^^ yielding 
an expression for f as a sum of O {log D) binomials with D = 2^. o 

Note also that while we focus on speed-ups that replace the polynomial degree D by log D 
in this paper, other practically important speed-ups combining semidefinite programming 
and sparsity are certainly possible (see, e.g., |Las06[ iKMOQj ). 

2. Background 

2.1. Amoebae and Efficient yi-Discriminant Parametrization. Let us first briefly 
review two important constructions by Gelfand, Kapranov, and Zelevinsky. 

Definition 2.1. Let c := (ci, . . . , Cm), let A = {ai, . . . , am} C Z" have cardinality m, and 
define the corresponding family of (Laurent) polynomials 

g^^:={cia;"i + ■ ■ ■ + c^x"™ | cGC'"}, 
where the notation X ^ . — X ' ' ' Xfi IS understood. When q 7^ for all i^{l, . . . , m} then 
we call A the support 0/ /(x) = ^™ CjX°% also using the notation Supp(/). o 

Definition 2.2. For any field K we let K* := K \ {0}. Given 
any g^C[xi, . . . we then define its amoeba, Amoeba(5'), to 
6e {Log|c| I c={ci,...,Cm)e{C*)"' and g{ci,...,Cm) = 0}. o 

Archimedean Amoeba Theorem, (weaker version of 
|GKZ94t Cor. 1.8]) Following the notation of Definition \2.2^ the 
complement of Amoeha{g) in is a finite disjoint union of 
open convex sets. ■ 

An example of an amoeba of a bivariate polynomial (see Exam- 
ple 12.51 below) appears in the right-hand illustration. While the 
complement of the amoeba (in white) appears to have 3 convex 
connected components, there are in fact 4: the fourth compo- 
nent is a thin sliver emerging further below from the downward 
pointing tentacle. 

Definition 2.3. Following the notation of Definition \2.1\ and letting f{x) := cix"'^ + • ■ ■ + 
Cmx"'"^ , we define Va — the ^-discriminant variety |GKZ94t Chs. 1 & 9-11] — to be the 
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closure of the set of all [ci : ■ ■ ■ : Cm] GP^ such that 

f - d.f - - df _Q 

dxi dXn 

has a solution in (C*)". We then define (up to sign) the ^-discriminant, Ayi eZ[ci, . . . , Cm], 
to be the (irreducible) defining polynomial ofVA when Va is a hyper surf ace. Finally, we let 
Vyi(]R) denote the real part ofVA- o 

Remark 2.4. The Va considered in this paper will all ultimately be hypersurfaces . o 



Reduction of N3t)la^(R] plotted on log paper, for the family 




Example 2.5. TaAim^ .A = {0, 404, 405, 808} , we see that 
3^ A consists simply of polynomials of the form f{xi) : = 
Ci + C2xf^^ + c^xf^^ + C4^x^^ . The underlying A- 
discriminant is then a polynomial in the Ci having 
609 monomial terms and degree 1604. However, while 
Aa is unwieldy, we can still easily plot the real 
part of its zero set Vyi(M) via the Horn-Kapranov 
Uniformization (see its statement below, and the 
illustration to the right), o 

The plotted curve above is the image of the real roots of Aa{c2, C4) ■.= Aa{1, C2, 1, C4) under 
the Log| ■ I map, i.e., the amoeba of Aa- Amoebae give us a convenient way to intro- 
duce polyhedral/tropical methods into our setting. For our last example, the boundary of 
Amoeba(Ayi) is contained in the curve above. 

yi-discriminants are notoriously large in all but a few restricted settings. For instance, 
the polynomial A{o,404,405,808} defining the curve above has the following coefficient for cf^^Ci: 
903947086576700909448402875044761267196347419431440828445529608410806270 

...[2062 digits omitted] 93441588472666704061962310429908170311749217550336. 

Fortunately, we have the following theorem, describing a one-line parametrization of Va- 



The Horn-Kapranov Uniformization. (See ||Kap91| , |PT05j . and pFS07[ Prop. 4.1].; 



Given A:={ai, . . . , a„i} CZ" with Va a hypersurface, the discriminant locus Va is exactly 
the closure of 

{[u.X'^^ : ■ ■ ■ : u^X''-] |n:=(ni,...,njGC-, Au = 0, E^i^i = 0, A= (Ai, . . . , G (C*)"}. ■ 

Thus, once we know the null-space of an (n + 1) x m matrix, we have a formula parametriz- 
ing Va- Recall that for any two subsets U,V O M^, their Minkowski sum U + V is 
{u + V \ u^U , veV}. Also, for any matrix M, we let M'^ denote its transpose. 

Corollary 2.6. Following the notation above, let A denote the (n + l) x m matrix whose i— 
column has coordinates corresponding to Ixa^, let B & be any real matrix whose columns 
are a basis for the right null-space of A, and define : — )■ via ip{t) := log 
Then Amoeba(Ayi) is the Minkowski sum of the row space of A and (p{C^). ■ 

If one is familiar with elimination theory, then it is evident from the Horn-Kapranov Uni- 
formization that discriminant amoebae are subspace bundles over a lower-dimensional amoeba. 
This is a geometric reformulation of the homogeneities satisfied by the polynomial Aa- 



Example 2.7. Continuing Example \2.5[ we observe that A - 



1111 
404 405 808 



has right 



null-space generated by the respective transposes 0/ (1, — 405, 404, 0) and (1,-2,0,1). The 
Horn-Kapranov Uniformization then tells us that Va is simply the closure of the rational 
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surface {[h + 12 : (-405ti - 2t2)\^°^ : AOU^X^^^ : tsA^"^] | h,t2eC,XeC*} C F^. Note that 
f and -^f have the same roots and that u ^ ^i/405 ^ well-defined bijection on M that 

1/405 



preserves sign. Note also that the roots of f and f(y) := ^/(^(ft) Vj differ only by a 

scaling when f has real coefficients, and that f is of the form 1 -\- C2y^^'^ + y"^^^ + c'^y^^^ . It then 
becomes clear that we can reduce the study o/ Va(IR) to a lower-dimensional slice: intersecting 
Vyi with the plane defined by ci = = 1 yields the following parametrized curve in C^; 

{ (^tte^ (frl)""'" ' T7^. (ItI)""'") I ^-^^^C}. In other words, the 
preceding curve is the closure of the set of all (c2, c'^ ) G (C*)2 such that 1 + 4 

^404 _|_ ^405 _|_ 

c'^x^^^ has a degenerate root in C*. Our preceding illustration of the image o/Vyi(M) within 
Amoeba(Ayi) (after taking log absolute values of coordinates) thus has the following explicit 
parametrization: 

{ (log |405ti + 2t2\ - log \t, + 12| - i log |404ti|, log jt^l + § log \t, + ^2! - fi log |404ti|) o 

A geometric fact about amoebae that will prove quite useful here is the following elegant 
quantitative result of Passare and RuUgard. Recall that the Newton polytope of a Laurent 
polynomial / G C[xf ^, . . . , x^^] is the convex hull olj^ the exponent vectors appearing in the 
monomial term expansion of /. 

Passare-Rullgard Theorem. |PR04[ Cor. 1] Suppose f EC[xf^ , xf^] has Newton polygon 
P. r/ien Area(Amoeba(/)) <7r2Area(P). ■ 

2.2. Discriminant Chambers and Cones, yi-discriminants are central in real root count- 
ing because the real part of Vyi determines where, in coefficient space, the real zero set of a 
polynomial changes topology. Recall that a cone in is any subset closed under addition 
and nonnegative linear combinations. The dimension of a cone C is the dimension of the 
smallest fiat containing C. 

Definition 2.8. Suppose A = {ai, . . . , am} CZ" has cardinality m and Vyi is a hypersurface. 
We then call any connected component C of the complement ofVA in F"^"^ \{ci ■ ■ ■ Cm = 0} a 
(real) discriminant chamber. Also let A denote the (n + 1) x m matrix whose i— column has 

\ A 

coordinates corresponding to 1 x a-i and let B be any real matrix B= \biA gM™^^ with j, 

Jd 

invertible. //log|C|i? contains an m- dimensional cone then we call C an outer chamber 
(ofVA)- All other chambers of Va o-i"^ called inner chambers (of Va)- Finally, we call 

the formal expression (ci, . . . , Cm)^ := (^c\^'^ ■ ■ ■ Cm'^ • • • , Ci'^ ■ ■ ■ Cm'^'^ a monomial change of 

variables, and we refer to images of the form (with C an inner or outer chamber) as 
reduced chambers, o 

It is easily verified that log |C'^| =log |C|-B. The latter notation simply means the image of 
log |C| under right multiplication by the matrix B. 

Example 2.9. The illustration from Example \2.5\ shows partitioned into what appear to 
be 3 convex and unbounded regions, and 1 non- convex unbounded region. There are in fact 
4 convex and unbounded regions, with the fourth visible only if the downward pointed spike 
were allowed to extend much farther down. So A = {0,404,405,808} results in exactly 4 
reduced outer chambers, o 



i.e., smallest convex set containing... 
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Note that exponentiating by any B as above yields a well-defined multiplicative homo- 
morphism from (M*)*" to (R*)'^ when B has rational entries with all denominators odd. In 
particular, the definition of outer chamber is independent of B, since (for the B considered 
above) Log|C^| is unbounded and convex iff Log|C'^*| is unbounded and convex, where B* 
is any matrix whose columns are a basis for the orthogonal complement of the row space of 
A. 

One can in fact reduce the study of the topology of sparse polynomial real zero sets to 
representatives coming from reduced discriminant chambers. A special case of this reduction 
is the following result. 



entries of B eQ^'^'^^^^^ have odd denominator, and 



is invertible. Also let /, (7 G 3\a \ V, 



A 



Lemma 2.10. (See |DRRS07| Prop. 2.17].) Suppose A G Tj"^ has n + 3 elements, A is 
not contained in any {n — l)-flat, A HQ has cardinality n for all facets Q of Convyi, all the 

A 

[B^ 

have respective real coefficient vectors c and c' with and c'^ lying in the same reduced 
discriminant chamber. Then all the complex roots of f and g are non-singular, and the 
respective zero sets of f and g in (M*)" are diffeotopic. In particular, for n = 1, we have 
that either f[x) and g{x) have the same number of positive roots or f{x) and g{—x) have 
the same number of positive roots. ■ 

2.3. Integer Linear Algebra and Linear Forms in Logarithms. Certain quantitative 
results on integer matrix factorizations and linear forms in logarithms will prove crucial for 
our main algorithmic results. 

Definition 2.11. Let Z"^*" denote the set ofnxm matrices with all entries integral, and let 
GL„(Z) denote the set of all matrices in Z"^" with determinant ±1 (the set o/unimodular 
matrices). Recall that any n x m matrix [uij] with Uij = for all i > j is called upper 
triangular. 

Given any MgZ"^™-, we then call an identity of the form UM = H, with H= [hij] eZ"^'" 
upper triangular and U G GhnC^), a Hermite factorization of M. Also, if we have the 
following conditions in addition: 

(1) hij > for all i, j . 

(2) for all i, if j is the smallest j' such that hiji^O then h^ > hiij for all i' <i. 
then we call H the Hermite normal form of M. o 

Proposition 2.12. We have that x^^ = (x^)-^ for any A, 5 gZ"^". Also, for any field K, 
the map defined by m{x)=x^ , for any unimodular matrix U gZ"-^"-, is an automorphism of 
{K*Y. ■ 

Theorem 2.13. [Stonni Ch. 6, Table 6.2, pg. 94] For any A = [a,j] G Z"^"^ with m>n, 
the Hermite factorization of A can be computed within 0(nm^'^'^^ log^(mmaxjj |ajj|)) bit 
operations. Furthermore, the entries of all matrices in the Hermite factorization have bit 
sz2:e 0(mlog(mmaXjj |ajj|)). ■ 



The following result is a very special case of work of Nesterenko that dramatically refines 
Baker's famous theorem on linear forms in logarithms |Bak77] . 



8 O. BASTANI, C. HILLAR, D. POPOV, AND J. M. ROJAS 

Theorem 2.14. (See |Nes03l Thm. 2.1, Pg. 55]. j For any integers 71, ai, . . . , 7Ar, a^v with 



1 



A(7,a) 



tti >2 for all i, define A(7, a) :=7i log(ai)+- ■ ■+7Ar log(a7v)- Then A(7, a) 7^0 =^ log 

N 

IS hounded above by 2.9{N + 2f/^{2eY^^^{2 + logmaxj |7j|) H logaj. ■ 

The most obvious consequence of lower bounds for linear forms in logarithms is an efficient 
way to determine the signs of monomials in integers. 

Algorithm 2.15. 

Input: Positive integers ai,ui, ... ,aM,UM and /3i,vi, . . . , (3n,vn with ai, /3i>2 for all i. 

Output: The sign of ■ ■ ■ a^f - f3l' ■■■1^7- 

Description: 

(0) Check via gcd-free bases (see, e.g., [BS961 Sec. 8.4]; whether • • ■ a"f = /5;^i ' ' ' 1^7 ■ 
If so, output "They are equal." an(i STOP. 

(1) Let U : = max{ni, . . . , um-, ^i, • • • , ^a^} and 

E:=^^{2ef'''+^^+\l + \ogU) x (flloga,,^ (ft log a). 

(2) For all i 6 [M] (resp. i G [N]), let Ai (resp. Bi) be a rational number agreeing with 
logOj (resp. XogPi) in its first 2 + E + log2 M (resp. 2 + E + log2 N) leading bits^ 

/M \ \ 

(3) Output the sign of ( ^WjA 1 — ( ^t>ji?.j 1 and STOP. 

Lemma 2.16. Algorithm \2.15\ is correct and, following the preceding notation, runs within 
a number of bit operations asymptotically linear in 

(M + iV)(30)*^+^L(log?7) fn^(log«,)) fn^(logA 

w/iere L(x) :=x(logx)^ loglogx. ■ 



i=l / \i=l 



Lemma 12.161 follows immediately from Theorem I2.14[ the well-known fast iterations for 
approximating log (see, e.g., |Ber03] ). and the known refined bit complexity estimates for 
fast multiplication (see, e.g., |BS96l Table 3.1, pg. 43]). 



3. Chamber Cones and Polyhedral Models 
3.1. Defining and Describing Chamber Cones. 

Definition 3.1. Suppose X cW^ is convex and Q^X is the polyhedral cone consisting of 
all c G M™ with c + X C X . We call Q the recession cone for X and, if p E M™ satisfies 
(1) p + Q^X and (2) p + c + Q ^ X for any ceQ\ {O}, then we call p + Q the placed 
recession cone. In particular, the placed recession cone for Log|C| for C an outer chamber 
(resp. a reduced outer chamber) is called a chamber cone (resp. a reduced chamber conej of 
Va- We call the facets of the (reduced) chamber cones of V a (reduced) walls of Va- We 
also refer to walls of dimension 1 as rays, o 



■^For definiteness, let us use Arithmetic- Geometric Mean Iteration as in |Ber03j to find these 
approximations. 
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Example 3.2. Returning to Example \2.5[ let us 
draw the rays that are the boundaries of the 4 re- 
duced chamber cones: While there appear to be just 
3 reduced chamber cones, there are in fact 4, since 
there is an additional (slender) reduced chamber 
cone with vertex placed much farther down. (The 
magnified illustration to the right actually shows 2 
nearly parallel rays going downward, very close to- 
gether.) Note also that reduced chamber cones need 
not share vertices, o 

Chamber cones are well-defined since chambers are log-convex, being the domains of con- 
vergence of a particular class of hypergeometric series (see, e.g., |GKZ94[ Ch. 6]). A useful 
corollary of the Horn-Kapranov Uniformization is a surprisingly simple and explicit descrip- 
tion for chamber cones. 

Definition 3.3. Suppose AgTj^ has cardinality n + 3, is not contained in any {n — l)-flat, 
and is not a pyramid^ Also let B be any real (n + 3) x 2 matrix whose columns are a basis 
for the right null space of A and let . . . , /3n+3 be the rows of B. We then call any set of 
indices Jc{l,...,n-|-3} satisfying: 

(a) [AjieJ is a maximal rank 1 submatrix of B. 

(b) ^jgj A is not the zero vector, 
a radiant subset corresponding to ^1. o 

Theorem 3.4. Suppose ^CZ" has cardinality n -\-3, is not contained in any {n — l)-flat, 
is not a pyramid, and Vyi is a hypersurface. Also let B be any real (n + 3) x 2 matrix 
whose columns are a basis for the right null space of A and let Pi, ... , Pn+3 be the rows of B. 

^ J' B^ and let s,- denote the row vector whose j— coordinate 
10 

is or log \ sij\ according or not. Then each wall of Vyi is the Minkowski sum of 

the row-space of A and a ray of the form Si — IR+ Xljgj /^'^ some unique radiant subset J 
of A and any zG J. In particular, the number of walls ofVA, the number of chamber cones 
of Va, (ind the number of radiant subsets corresponding to A are all identical, and at most 
n + 3. 



Finally, let S := [sij] : = B 



Note that the hypotheses on A are trivially satisfied when A gIj has cardinality 4. Note 
also that the definition of a radiant subset corresponding to A is independent of the chosen 
basis B since the definition is invariant under column operations on B. 



Remark 3.5. Theorem 3.4\ thus refines an earlier result of Dickenstein, Feichtner, and 
Sturmfels ( |DFS07l Thm. 1.2]) where, in essence, unshifted variants of chamber cones (all 
going through the origin) were computed for nonpyramidal A G of arbitrary cardinality. 
A version of Theorem \3.4\ for A of arbitrary cardinality will appear in |PRRT11] . o 

Example 3.6. It is easy to show that a generic A satisfying the hypotheses of our theorem 
will have exactly n-\-3 chamber cones, e.g., Example \2.5[ It is also almost as easy to construct 



examples having fewer chamber cones. For instance, taking A :- 








1 




2 









2 















1 


1 




1 



*The last assumption simply means that there is no point aEA such that A \ {a} lies in an (n — l)-flat. 
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we see that B :- 



-1 o 

2 1 

-2 —2 

1 

1 



satisfies the hypotheses of Theorem \3.4\ and that {1,5} is a non- 



multiple of Pi 
f modulo a mu 



radiant subset. So the underlying Vyi has only 3 chamber cones, o 

Proof of Theorem 13. 4t First note that by CoroUary 12.6^ Amoeba(Ayi) is the Minkowski 

sum of and the row space of A, where ip{t) := Log So then, determining the 

walls reduces to determining the directions orthogonal to the row space of A in which ip{t) 
becomes unbounded. 

Since 1 := (1, . . . , 1) is in the row space of A, we have 1B = and thus (p{t) = ip(t/M) for 
all M>0. So we can restrict to the compact subset {(^1,^2) I l^iP + |^2p = l} and observe 
that ip{t) becomes unbounded iff tf3'[ — > for some i. In particular, we see that there are 
no more than n + 3 reduced walls. Note also that tpf — > iff t tends to a suitable (nonzero) 

multiple of /3i ^ ^ , in which case the coordinates of ip{t) becoming unbounded are exactly 

those with index j G J where J is the unique radiant subset corresponding to those rows of B 
that are nonzero multiples of hi. (The assumption that A not be a pyramid implies that B 
can have no zero rows.) Furthermore, the coordinates of ip{t) that become unbounded each 
tend to —00. Note that Condition (b) of the radiance condition comes into play since we are 
looking for directions orthogonal to the row-space of A in which ip{t) becomes unbounded. 

We thus obtain that each wall is of the asserted form. However, we still need to account 
for the coordinates of (p{t) that remain bounded. Now, if t tends to a suitable (nonzero) 

^ ^ , then it clear that any coordinate of ip{t) of index j ^ J tends to Sij 

tiple of 1 added to f(t)). So we have indeed described every wall, and given 
a bijection between radiant subsets corresponding to A and the walls of Va- 

To conclude, note that the row space of A has dimension n + 1 by construction, so the 
walls are all actually (parallel) n-plane bundles over rays. So by the Archimedean Amoeba 
Theorem, each outer chamber of Va must be bounded by 2 walls and the walls have a natural 
cyclic ordering. Thus the number of chamber cones is the same as the number of rays and 
we are done. ■ 

3.2. Which Chamber Cone Contains Your Problem? An important consequence of 
Theorem 13. 41 is that while the underlying yi- discriminant polynomial A^i may have huge coef- 
ficients, the rays of a linear projection of Amoeba(Ayi) admit a concise description involving 
few bits, save for the transcendental coordinates coming from the "shifts" Sj. By applying 
our quantitative estimates from Section 12.31 we can then quickly find which chamber cone 
contains a given n-variate (n -|- 3)-nomial. 

Theorem 3.7. Following the notation of Theorem 8.4, suppose /G S^yiflMfxi, . . . and let 
T denote the maximum bit size of any coordinate of A. Then we can determine the unique 
chamber cone containing f — or correctly decide if f is contained in 2 or more chamber 
cones — within a number of arithmetic operations polynomial in n + r. Furthermore, if 
/ G 3vi n . . . , x„], a is the maximum bit size of any coefficient of f , and n is fixed, then 
we can also obtain a bit complexity bound polynomial in r + a. ■ 

Theorem 13.71 is the central tool behind our complexity results and follows immediately 
upon proving the correctness of (and giving suitable complexity bounds for) the following 
algorithm: 
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Algorithm 3.8. 

Input: A subset A gIj"^ of cardinality n + 3 (with A not a pyramid and not contained in any 
{n — l)-flat, and Vyi a hypersurface) and the coefficient vector c of an 

/e3^AnM[xi,...,xJ. 

Output: Radiant subsets 3 and J' (corresponding to A) generating the walls of the unique 
chamber cone containing f , or a true declaration that f is contained in at least 2 
chamber cones. 

Description: 

(-5) (Preprocessing) Compute the Hermite Factorization = U'^A and let B be the 
submatrix defined by the rightmost 2 columns of U. 

(-4) (Preprocessing) Let (3i, . . . , (3n+3 be the rows of B, S := [sij] := B ^ ^ B^ , and 

let Si denote the row vector whose j— coordinate is or log \ sij\ according as %s 
or not. 

(-3) (Preprocessing) Find all radiant subsets Jc {1, . . . , n + 3} corresponding to A. 

(-2) (Preprocessing) For any radiant subset 3 let /3^:= — Ylj&P'j "^J denote the row 

vector SiB for any fixed iE3. 
(-1) (Preprocessing) Sort the in order of increasing counter-clockwise angle with the 

x-coordinate ray and let R denote the resulting ordered collection of Pj. 

(0) (Preprocessing) For any radiant subset 3 let vj G denote the intersection of the 
lines sj + ]R/3j and sjr + ]R/3j,, where (3y is the counter-clockwise neighbor of (i'^. 

(1) Let ConeC aunt -.= 0. 

(2) Via binary search, attempt to find a pair of adjacent rays of the form 
{vj + M_|_/3j, vj + M+zS^/) containing Log|c|i?. 

(a) // (ConeCount = and there is no such pair of rays) 
or 

(ConeCount = l and there is such a pair of rays) 

then output ' 'Your f lies in at least 2 distinct chamber cones.'' and 
STOP. 

(b) If ConeCount = and there is such a pair of rays, set ConeC ount\=ConeCount + 1, 
delete f3j and f3j, from R, and GOTO STEP (2). 

(3) Output ' 'Your f lies in the unique chamber cone determined by 3 and 3'.'' 
and STOP. 



Remark 3.9. An important detail for large scale computation is that the preprocessing steps 
(-5)-(0) need only be done once per support A. This can significantly increase efficiency in 
applications where one has just one (or a few) A and one needs to answer chamber cone 
membership queries for numerous f with the same support, o 

Proof of Correctness of Algorithm 13. 8t First note that the computed matrix B indeed 
has columns that form a basis for the right null-space of A. This is because our assumptions 
on A ensure that the rank of yl is n + 1 and thus the last 2 rows of consist solely of 
zeroes. 

By construction, Theorem 13.41 then implies that the Pj are exactly the reduced rays for Va, 
modulo an invertible linear map. (The invertible map arises because right-multiplication by 
B induces an injective but non-orthogonal projection of the right null-space of A onto M^.) 
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It is then clear that the preprocessing steps do nothing more than give us a -B suitable for 
Theorem 13.41 and a sorted set of reduced rays ready for chamber cone membership queries 
via binary search. In particular, since the reduced chamber cones cover M^, the correctness 
of Steps (l)-(3) is clear and we are done. ■ 

In what follows, we will use the "soft-Oh" notation 0*{f) to abbreviate bounds of the form 
0(/(log/)0«). 

Complexity Analysis of Algorithm 13. 8t Let us first approach our analysis from the 
more involved point of view of bit complexity. Our arithmetic complexity bound will then 
follow upon a quick revisit. 

By Theorem 12.131 (which quotes bounds from |StoOOj ). Step (-5) takes 0(?7,^-^^^r^) bit 
operations. Also, the resulting bit size for the entries of B is 0{nT). 

The complexity of Step (-4) is negligible, save for the approximation of certain logarithms. 
The latter won't come into play until we start checking on which side of a ray a point lies, 
so let us analyze the remaining preprocessing steps. 

Step (-3) can be done easily through a greedy approach: one simply goes through the 
rows (32, ... , (3n+3 to see which rows are multiples of f3i. Once this is done, one checks if the 
resulting set of indices is indeed radiant or not, and then one repeats this process with the 
remaining rows of B. In summary, this entails O(n^) arithmetic operations on number of 
bit-size Oinr), yielding a total of 0*{Tn?r) bit operations. 

Step (-2) has negligible complexity. 

The comparisons in Step (-1) can be accomplished by computing the cosine and sine of the 
necessary angles via dot products and cross products. Via the well-known asymptotically 
optimal sorting algorithms, it is then clear that Step (-1) requires O(nlogn) arithmetic 
operations on integers of bit size 0{nT), contributing a total of 0*(r2^r log n) bit operations. 

Step (0) has negligible complexity. 

At this point, we see that the complexity of the Preprocessing Steps (-5)-(0) is 0{n^'^'^^r'^) 
bit operations. 

Continuing on to Steps (l)-(3), we now see that we are faced with 0(log?7,) sidedness com- 
parisons between a point and an oriented line. More precisely, we need to evaluate O(logn) 

Log|c|5 - S3 

thanks to Algorithm [215] and Lemma EEl takes O(n302"+5L(a + nT)L{aY+^L{nTY+'^) bit 
operations. 

We have thus proved our desired bit complexity bound which, while polynomial in r -|- a 
for fixed n, is visibly exponential in n. Note however that the exponential bottleneck occurs 
only in the sidedness comparisons of Step (2). 

To obtain an improved arithmetic complexity bound, we then simply observe that the 
sidedness comparisons can be replaced by computations of signs of differences of monomials, 
simply by exponentiating the resulting linear forms in logarithms. Via recursive squaring 
[BS961 Thm. 5.4.1, pg. 103], it is then clear that each such comparison requires only O(n^r) 
arithmetic operations. So the overall number of arithmetic operations drops to polynomial 
in n + r and we are done. ■ 



signs of determinants of matrices of the form 



Each such sign evaluation. 



Let us now state some final combinatorial constructions before fully describing how cham- 
ber cones apply to real root counting. 
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3.3. Canonical Viro Diagrams and the Probability of Lying in Outer Chambers. 

Our use of outer chambers and chamber cones enables us to augment an earlier construction 
of Viro. Let us first recall that a triangulation of a point set A is simply a simplicial complex 
S whose vertices lie in A. 

Definition 3.10. We say that a triangulation of A is coherent iff its maximal simplices are 
exactly the domains of linearity for some function i that is convex, continuous, and piecewise 
linear on the convex hull of A. In particular, we will sometimes define such an i by fixing 
the values i{a) for just those aEA and then employing the convex hull of the points (a,£(a)) 
as a ranges over A. The resulting graph is known as the lower hull of the lifted point set 
{(a,£(a)) I aeA}. o 

Definition 3.11. (See Proposition 5.2 and Theorem 5.6 of |GKZ94l Ch. 5, pp. 378-393].; 
Suppose A (ZT/^ is finite and the convex hull of A has positive volume and boundary OA. 
Suppose also that A is equipped with a coherent triangulation E and a function s : A — > {±} 
which we will call a distribution of signs for A. We then call any edge with vertices of opposite 
sign an alternating edge, and we define a piece-wise linear manifold — the Viro diagram 
Vyi(S, s) — in the following local manner: For any n-cell Cg S, let Lc be the convex hull of 
the set of all midpoints of alternating edges ofC, and then define s) := IJ Lc\dA. 

C an n-cell 

When yi = Supp(/) and s is the corresponding sequence of coefficient signs, then we also call 
^T.{f) '■ = '^a{^, s) the Viro diagram of / corresponding to S. o 

Example 3.12. Consider /(x) ■.= l — xi — X2 + 3xfx2 + 3x1^2 . Then Supp(/) = {(0, 0), (1, 0), 
(0, 1), (1, 4), (4, 1)} and has convex hull a pentagon. So then there are exactly 5 coherent 
triangulations, yielding 5 possible Viro diagrams for f (drawn in thicker green lines): 




Note that all these diagrams have exactly 2 connected components, with each component 
isotopic to an open interval. Note also that our f here is a 2-variate (2 + 3)-nomial. o 

Definition 3.13. Suppose AdV^ has cardinality n + 3, is not a pyramid, is not contained 
in any {n — l)-flat, and Va is a hypersurface. Also let B be any real (ra + 3) x 2 matrix 
whose columns are a basis for the right null space of A. For any f ^3^^ let us then define 
v{f) := (wi(/), . . . ,w„+3(/)) := (Xliej + (^Xljeg^i) '^here 3 and S are the unique radiant 
subsets corresponding to the unique chamber cone containing Log|c|. (We set v{f) := O 
should there not be a unique such chamber cone.) Let us then define ArchNewt(/) to be the 
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convex hull of {{ai,Vi) | z G {1, . . . , n + 3}} and let denote the triangulation of A induced 
by the lower hull o/ ArchNewt(/). ^4/50, we call any polynomial of the form ^ CiX""^ — 

with Q a cell ofHf — a canonical lower polynomial for f . Finally, we call V(/) :=Ve(/) 
the canonical Viro diagram of f . o 

Example 3.14. Let /(xi) ■.= l-\x^^^+x^^^-2x^^^ , c:= (l, -\, 1,2), andA:= {0, 404, 405, 808} 
A routine calculation then reveals that {{2}, {3}} is the pair of radiant subsets corresponding 
to the unique chamber cone containing Log|c|. We then obtain that v{f) = (0,1,1,0) and 

thus ArchNewt(/) is — '''''^^~~~~^^ » (modulo some artistic stretching). In particular, 
thus has the single cell [0, 808], which is an alternating cell, and thus V{f) consists of a single 
point. More than coincidentally, f has exactly 1 positive root, o 

Example 3.15. Returning to Example \3.12\ let c:= [1, —1, —1,3,3) . A routine calculation 
then reveals that the unique chamber cone containing Log|c| is defined by the pair of radiant 
subsets {{2}, {3}}. So then v{f) = (0,1,1,0,0) and is the exactly the upper middle 
triangulation from the illustration of Example \3.12[ V(/) then consists of 2 disjoint open 
intervals and, more than coincidentally, the positive zero set of f has exactly 2 connected 
components, each homeomorphic to an open interval, o 

Theorem 3.16. Following the notation above, set ft{x) := ^"^f Qt'^'^-^-'x"' and assume in 
addition that A Cl Q has cardinality n for all facets Q of Convyi. Then c := (ci, . . . , c^+s) 
lies in an outer chamber =^ the positive zero sets of ft, as t ranges over (0,1], are each 
diffeotopic to the positive zero set of fi. In particular, fi = f and thus, when c lies in an 
outer chamber, the positive zero set of f is isotopic to V(/). 

Remark 3.17. Forn = l we thus obtain that the number of positive roots of the tetranomial 
f is exactly the cardinality of its canonical Viro diagram, o 

Proof: By construction, the image of Log| (cit^'^'--^^ . . . , Cn+3t'""+^^^'')\ as t ranges over (0, 1] 
is simply a ray entirely contained in a unique chamber cone. Moreover, by assumption (and 
since outer chambers are log convex), the ray is also contained entirely in Log| ■ | of an outer 
chamber. So the first part of our theorem follows from Lemma 12.101 

The final part of our theorem is then just a reformulation of Viro's Theorem on the isotopy 
type of toric deformations of real algebraic sets (see, e.g., |GKZ94l Thm. 5.6]). ■ 

The main contribution of our paper is thus an efficient method to associate a canonical 
Viro diagram to the positive zero set of a given / so that both manifolds have the same 
topology. Such a method appears to be new, although the necessary ingredients have existed 
in the literature since at least the 1990s. In particular, to the best of our knowledge, all 
earlier applications of Viro's method designed clever / having the same topology as some 
specially tailored Viro diagram, thus going in the opposite direction of our construction. 

We state up front that our method does not work for all /. However, our development 
yields a sufficient condition — outer chamber membership — that holds with high probability 
under the stable log-uniform measure. 

Theorem 3.18. Suppose A C has cardinality n + 3, is not a pyramid, is not contained 
in any {n — l)-flat, and Va is a hypersurface. Suppose also that the coefficients of an 
/ G 3\A n ]R[xi, . . . ,Xn] are independently chosen via the stable log-uniform measure over M 
(resp. Tj). Then with probability 1, f lies in some outer chamber. In particular, if we assume 
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in addition that AHQ has cardinality n for all facets Q of Convyi, then the positive zero set 
of f is isotopic to V{f) with probability 1. 

Proof: By Theorem I3.4[ Amoeba(Ayi) is an n-plane bundle over Amoeba(Ayi) , where 
Ayi G Z[a,b] and Ayi(ci, . . . , c^+s) = Ayi(a(c), /3(c)) for suitable monomials a and f3 in c. 
Furthermore, thanks to Corollary 8 of [PSTOSj . the complement of Amoeba(Ayi) has no 
bounded convex connected components. 

Letting c denote the coefficient vector of / we thus obtain that / lies in an outer chamber 
iff Log|c| ^ Amoeba(Ayi). In particular, it is clear by the Passare-RuUgard Theorem that 
in any large centered cube C, the volume of Amoeba(Ayt) fl C occupies a vanishingly small 
fraction of C. So the ffist assertion is proved. 

The final assertion is an immediate consequence of the first assertion and Theorem 13.161 

■ 

Theorem 11.41 then follows easily from Theorems 13.161 and 13.181 The applications of Theo- 
rems 13.161 and 13.181 to computational real topology will be pursued in another paper. 

3.4. Proving Theorem 11.41 Let us first consider the following algorithm for counting the 
positive roots of "most" real univariate tetranomial. 

Algorithm 3.19. 

Input: A tetranomial f eM.[xi] with support A. 

Output: A number in {0, 1, 2, 3} that, for any f in an outer chamber ofVj\, is exactly the 

number of positive roots of f . 

Description: 

(1) Via Algorithm \3.^ and any sub- quadratic planar convex hull algorithm (see, e.g., 
|USvKOO"] )■ compute the canonical Viro diagram V(/). 

(2) // / did not lie in a unique chamber cone then output ' 'Your / does not lie in 
an outer chamber so please use an alternative method.'' anc? STOP. 

(3) Output the cardinality ofV{f) and STOP. 

Assuming Algorithm 13. 191 is correct, we can count the real roots of / simply by applying 
Algorithm 13.191 to f{xi) and /(— Xi). Theorem 11.41 thus follows immediately upon proving 
the correctness of our last algorithm and a suitable complexity bound. We now do this. 

Proof of Correctness of Algorithm I3.19t By Theorem 13. 16^ the number of positive 
roots of / is exactly the cardinality of V(/) whenever / is in an outer chamber. So we indeed 
have correctness. ■ 

Complexity Analysis of Algorithm 13. 19t Let us first observe that Algorithm 13 . 1 91 indeed 
gives a correct answer with probability 1 (relative to the stable log-uniform measure): this 
is immediate from Theorem 13.181 

So now we need only prove a suitable complexity bound. Let us begin with the more refined 
setting of bit complexity: From our earlier complexity analysis of Algorithm 13. 8[ it is clear 
that Step (1) needs O(log^-D) + 0{L{h + log D)L{h)'^L {log D)^) bit operations, neglecting 
the computation of V{f). The complexity of computing V(/) (which is essentially dominated 
by computing the convex hull of 4 points with coordinates of bit size 0(log D)) is clearly neg- 
ligible in comparison. So we obtain a final bit complexity bound of O* ((/z -|- log D)h^ log^ . 
(The complexity of Steps (2) and (3) is negligible.) 

For arithmetic complexity, our earlier complexity analysis of Algorithm 13.81 specializes 
easily to an upper bound of 0{log^ D). (The speed-up arises from the easiness of checking 



16 



O. BASTANI, C. HILLAR, D. POPOV, AND J. M. ROJAS 



inequalities involving integral powers of real numbers in the BSS model over M.) So we are 
done. ■ 

Remark 3.20. It is important to note that when f lies in a chamber cone but not in any 
outer chamber, Alaorithm Vj.liA can give a wrong answer. However, thanks to Theorem \y.l8[ 
such an occurence has probability under the stable log-uniform measure, o 



To prepare for the proof, we first set up some notation. Fix positive integers i and m. Let 
P = (pij) G N^^™' be a matrix of natural numbers ordered as 



for i = 1, . . . , s and j = l,...,m — 1. Also, let aij be indeterminates over the same indexing 
set. Consider now the following polynomial: 



in which each nonzero gi{P) is a homogeneous (quadratic) polynomial in N[ajj]. Notice 
that at most im"^ of these g-i are nonzero. We will refer to the integer pij as the exponent 
corresponding to the coefficient aij. 

Lemma 4.1. The set of polynomials gi{P) arising from Sp{x) as P ranges over all i x m 
nonnegative integer matrices is finite. 

Proof: The form of Equation (14.11) allows for at most 2'"*^ possible such gi. ■ 

Suppose now that / G M[x] has degree d and is a sum of i squares, each involving at most 
m terms. Then, there is a set of exponents P and an assignment dij G M for the coefficients 
aij such that / = Sp{x). If we fix a set of exponents P for /, then conversely, any real point 
in the variety determined by the gi gives a representation of / as a sum of i squares, each 
involving at most m terms. 

We will prove Theorem 11.51 using contradiction by showing that a certain infinite family 
of trinomials cannot all have sparse representations of the form in Equality (14. ip . For this 
approach to work, however, we will need to find a universal set of coefficients d^j that 
"represents" all of them. 

Lemma 4.2. Let F C ]R[x] be an infinite collection of polynomials which are sums of i 
squares, each involving at most m terms. Moreover, suppose that the nonzero coefficients of 
polynomials f E F come from a finite set C . Then, there is an infinite subset /i, /2, . . . G -F 
of them with corresponding exponents Pi, P2, • • • G N^^"* and a single set of real coefficients 
dij such that fk = Sp^^{x)\a^^=aij for all k. 

Proof: Let /i, /2, . . . G -F be an infinite sequence of polynomials with corresponding expo- 
nents Pi,P2, . . . G H^^™. Also let S be the set of all possible (polynomial) coefficients of 
polynomials 5'p^(x); from Lemma [4.1^ this set is finite. By assumption, each f^ gives rise 
to a set of equations involving a subset of S and real numbers coming from the finite set 
C. The set of all such equations is again finite, and therefore, by the infinite pigeon-hole 



4. Proving Theorem 11.51 



Pii>P2i>---> Psi and Pij > Pi{j+i), 




(4.1) 



gd{P)x'' + ■ ■ ■ + gi{P)x + go{P) 
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principle, there is a subsequence of the fk which have the same set of equations governing 
their coefficients aij. Picking any real solution to such a set of equations finishes the proof. 

We now have all the tools we need to prove Theorem 11.51 However, to make the argument 
less cumbersome, we first recall "little-oh" notation: for a function /i : N — ?■ M, we say that 
h{n) = o{n) if 

, h(n) 
lim = 0. 

n— >oo n 

It is easy to see that the sum of any finite number of such functions is also o{n). Moreover, 
if lim^^oo = P for some constant p, then p{n) = np + o(n). 

Proof of Theorem II. 5t Suppose, by way of contradiction, that every positive definite 
trinomial can be written sparsely as the sum of i squares, each involving at most m terms. 
Consider the infinite sequence of positive definite trinomials, 

f, = x''^ + x''-' + l, k = l,2,.... (4.2) 

Using Lemma 1121 we can find a subsequence fk^ with corresponding exponents Pk^, Pk2, • . . G 
^exm. g^j^j single set of real numbers Ojj such that fk^ = S'p^.^ (x) \aij=aij for s = 1, 2, . . .. We 
will assume that the aij are chosen such that the number of them that are zero is maximal 
among all such sets of coefficients dij. Moreover, we shall assume that this maximality still 
holds if we restrict to any infinite subsequence of the fk,- For clarity of exposition in what 
follows, we will not keep updating the subscripting of indices when taking subsequences. 
Given an exponent matrix Pk^ G N^^'", define a new matrix 

Pk — ~, — Pk ■ 

Kg 

This corresponds naturally to the transformation x t— t- x^^''^ applied to both sides of an 
equation fk^i^) = Sp^^{x). Since deg(/fcj = 2ks, each matrix Pk^ has entries in the interval 
[0,1]. By compactness, we may choose a subsequence Pk^ such that Pk^ converges in the 
(entry-wise) Euclidean norm to a matrix P = {pij) G [0, 1]^^™. Clearly, we have pu = 1 
and also that some entry of P is 0; it turns out that these are the only possible numbers for 
entries of P which play a role in (14. ip . 
Claim: If pij ^ 0, 1, then dij = 0. 

Suppose, on the contrary, that P contains r > 2 real numbers, pi, . . . ,Pr with = pi < 
■ ■ ■ < Pr = 1- Each power of x occurring in a summand of Equality (14. ip . after squaring, is 
of the form 

ksPu + ksPv + o{ks). (4.3) 

Thus, for all sufficiently large s, the powers of x occurring in Equality (14. ip are partitioned 
into classes determined by the number of distinct values, 

Pu+Pv, u,v = 1,. . . ,r. 

Notice that the numbers in Formula (14. 3 p all become strictly smaller (resp. larger) than 
2ks — 1 (resp. 0) as s — )■ oo unless u = v = r (resp. u = v = 1). In particular, with s large, 
the polynomials gw{Pks) G N[aij] for 

w = 2ks + o{ks) and w = + o{k^) (4.4) 
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do not involve the indeterminates aij coming from exponents of the form ksPu + o{ks) with 
u ^ l,r. Conversely, each monomial in every polynomial gw{Pks) with w not in one of the 
classes from f l4.4p contains at least one such indeterminate aij. 

Since the only nonzero coefficients of the sequence (14. 2p come from the classes from (14.41) . it 
follows that we may replace with all coefficients aij corresponding to exponents ksPu + o{ks) 
with u ^ l,r and still have an equality 

The claim therefore follows from the maximality property of the chosen set of coefficients 

We next examine what all this says about the limiting behavior of the expressions from 
Equality (14. ip . From the claim, it follows that when s is large, we need only consider those 
exponents from the matrices Pk^ that are on the order 

kg + o{ks) and + o{ks)- 

Fix such a large s and consider the smallest exponent p on the order kg + o{ks) that occurs 
with a nonzero coefficient in (14. ip after substituting the for the aij. When the sum from 
(14. ip is expanded, the term x^^ will appear with positive coefficient; i.e., 

g2p{Pks)\ai^=aij 7^ 0. 

It follows from the form of the sequence (14.20 that p = kg (since the other nonzero term is 
odd). In particular, it is not possible to obtain a nonzero coefficient for x^^""^ in fk^. This 
contradiction completes the proof. ■ 
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